Sharing the components of transposition-invariant distance, DIT, on DIT-organized Burkhard-Keller structure in searches for best matching strings

نویسندگان

  • Octavio Santana Suárez
  • José Rafael Pérez Aguiar
  • Zenón José Hernández Figueroa
  • G. Rodriguez
چکیده

In this work various construction character/frequency information sharing structure approaches are proposed in order to optimize transposition_invariant distance evaluation, [SD87], that distance is used to construct a Burkhard-keller tree, [BK73] and [NK82], where is organized a dictionary of strings token over a characters alphabet [SP88], to achieve searchings of strings best matching one on Levenshtein sense [LE66]. 0.-INTRODUCTION: This work is on the line of optimizing search schemes for the strings best matching one and data structures standing them. Given a strings dictionary, a distance defined on the strings space and a search string (belonging or not to the dictionary) the question is to find all strings at minimum distance of the searching string. The aim of this work is to organize the character/frequency information from the dictionary strings in such a way that it becomes possible to improve the transposition_invariant distance evaluation, using former evaluations by means of sharing the character/frequency information with different strings. In section 1 the distance used is the directional distance, DD, introduced by Levenshtein, [LE66], evaluated by Wagner and Fisher, [WF74], and whose optimized evaluation algorithm is described in [SP88]. In section 2 appears a Transposition_Invariant distance, DIT, with a lower cost than DD, which is used as an adaptive filter to searchings [SD87]. DITE+DD search scheme [SD87] is described in section 3. Section 4 discusses BK_DIT+DD search scheme [SP88] achieved over a Burkhard and Keller (BK) tree structure, [BK73] and [NK82]. DIT components sharing structure is studied in section 5. In section 6 various construction approaches for that structure are proposed. Section 7 suggests an approach for the BK tree construction according to the sharing structure. Section 8 shows experimental results and conclusions of this work. µ, the nil string. ³ ³X³ ³ is the length of X. An editing operation is a pair (ß,Ω Ω)=,/(µ µ,µ µ), where ß and Ω Ω are strings of length less or equal than one, i. e., they are µ µ or they are a single character. String Y results of the application of (ß,Ω Ω) on X if X=σ σßΦ Φ and Y=σΩΦ σΩΦ, that is written X->Y.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Increasing Radius Search Schemes for the Most Similar Strings on the Burkhard-keller Tree

In this work search schemes are proposed for the most similar strings to a given one, on the sense of the Levenshtein directional distance, working on a Burkhard_Keller structure, [BK73] and [NK82], organized by the transposition_invariant distance, [SD87], using a increasing search radius as opposed to the decreasing search radius schemes, [SP88]. Some organization approaches are studied to fi...

متن کامل

ارائه روش پرتودرمانی تطبیقی جدید تک ایزوسنتریک در پرتودرمانی خارجی دیواره سینه بیماران مستکتومی شده

Introduction and purpose: The aim of this study is introducing a new Mono-Isocentric Technique)MIT) for external  radiotherapy of the chest wall in the mastectomy patients with 6MV photon beams and comparing this technique with the Dual-Isocentric Technique(DIT). Materials and methods: Data of 18 mastectomy patients having chest wall radiotherapy were used. The target tissue was divided into...

متن کامل

Restricted Transposition Invariant Approximate String Matching Under Edit Distance

Let A and B be strings with lengths m and n, respectively, over a finite integer alphabet. Two classic string mathing problems are computing the edit distance between A and B, and searching for approximate occurrences of A inside B. We consider the classic Levenshtein distance, but the discussion is applicable also to indel distance. A relatively new variant [8] of string matching, motivated in...

متن کامل

Transposition invariant string matching

Given strings A = a1a2 . . . am and B = b1b2 . . . bn over an alphabet Σ ⊆ U, where U is some numerical universe closed under addition and subtraction, and a distance function d(A,B) that gives the score of the best (partial) matching of A and B, the transposition invariant distance is mint∈U{d(A+ t,B)}, where A+ t = (a1 + t)(a2 + t) . . . (am + t). We study the problem of computing the transpo...

متن کامل

Comparing the monoisocentric and dual isocentric techniques in chest wall radiotherapy of mastectomy patients

The monoisocentric (MIT) and dual isocentric (DIT) techniques are compared for the mastectomy patients undergoing chest wall radiotherapy, and a new practical method is suggested for determining the dose calculation reference point to be used in the MIT. Data of 18 mastectomy patients having chest wall radiotherapy were used. To find the appropriate dose calculation reference point for the MIT,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989